InDaQu: Enabling user-centered definition and exchange of consistency constraints for data cleaning

نویسندگان

  • Stefan Brüggemann
  • Yvette Teiken
  • Hans-Jürgen Appelrath
  • S. Brüggemann
  • Y. Teiken
  • H.-J. Appelrath
چکیده

Severe data quality problems exist in most public health care systems and inconsistent data sets often occur. Consistency constraints can be used to define valid and invalid data. Existing solutions of such constraints like rule systems are often difficult to maintain, not human-readable, and of a bad quality like containing contradictory rules. With InDaQu we present an approach that allows domain experts to easily create and maintain consistency constraints using an introduced domain-specific language. These constraints are being stored in an ontology, which allows for an automated inconsistency detection in the defined rules themselves. We identified several scenarios in which consistency constraints can be interchanged and exchanged between different participants. The approach has been successfully evaluated in the cancer registry of Lower Saxony.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Learner-Centered Education in the Iranian EFL Context: A Glance through the Impediments

Though learner-centered paradigm of education has long been introduced to pedagogy in general and language teaching in particular, it seems that scant heed has been given to its implementation as well as the restrictions and challenges on its way. In an attempt to shed more light on the status of learner-centered instruction in Iranian language schools, particularly as regards the imped...

متن کامل

Enabling User Preferences through Data Exchange

This paper describes the application of data exchange, for integrating user and air traffic management (ATM) systems, to enable user preferences for en-route flights. User preferences may be defined in terms of a fourdimensional (4D) user-preferred trajectory, or a series of profile constraints (e.g., speed, routing, time), depending on user capability. Deviations from the user’s preference are...

متن کامل

Design of modern interactive and ergonomic home air purifier

Introduction: The subject of this research is having healthy air and its challenge is air purification to have this type of air. Healthy air is free of any pollutants, including odors, harmful gasses, dust, and viruses, especially corona. This healthy air is provided by a purifier device. One of the problems of metropolises is the lack of healthy air, which is one of the most important human ne...

متن کامل

A Study on Application of user Centered Design For Inteior Design of Travel Bus

This study tries to redesign the interior design of inter-city bus in order to fulfill needs of Iranian User. The goal of this study is practically investigate how user centered design can be applied considering cultural needs of Iranian user. By defining common needs between cultural and physical aspects of Iranian user, the main focus was on improving the sitting condition of the traveler wit...

متن کامل

Rank-based strategies for cleaning inconsistent spatial databases

A spatial dataset is consistent if it satisfies a set of integrity constraints. Although consistency is a desirable property of databases, enforcing the satisfaction of integrity constraints might not be always feasible. In such cases the presence of inconsistent data may have a negative effect on the results of data analysis and processing and, in consequence, there is an important need for da...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010